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Abstract — For a linear block code C, its stopping redundancy is 
defined as tlie smallest number of check nodes in a Tanner graph 
for C, such that there exist no stopping sets of size smaller than 
the minimum distance of C. Schwartz and Vardy conjectured that 
the stopping redundancy of an MDS code should only depend 
on its length and minimum distance. 

We define the (n, t)-single-exclusion number, S(n, t) as the 
smallest number of t-subsets of an n-set, such that for each i- 
subset of the n-set, i — 1, . . . , f + 1, there exists a t-subset that 
contains all but one element of the i-subset. New upper bounds 
on the single-exclusion number are obtained via probabilistic 
methods, recurrent inequalities, as well as explicit constructions. 
The new bounds are used to better understand the stopping 
redundancy of MDS codes. In particular, it is shown that for 
[n,k — n — d + MDS codes, as n oo, the stopping 
redundancy is asymptotic toS'(n,d--2), ifd = o{y/n), or if 
k — o{y/n), k —* oo, thus giving partial confirmation of the 
Schwartz-Vardy conjecture in the asymptotic sense. 

Index Terms — erasure channel, iterative decoding, MDS code, 
single-exclusion number, stopping redundancy, stopping set, 
Turan number. 



I. Introduction 

The stopping redundancy of a linear code characterizes the 
minimum "complexity" (number of check nodes) required in a 
Tanner graph for the code, such that iterative erasure decoding 
achieves performance comparable to (up to a constant factor, 
asymptotically) maximum-likelihood (ML) decoding. It can 
be viewed as a basic measure of the complexity-performance 
tradeoff in the use of redundant parity checks (RPCs) in an 
iterative decoder on the erasure channel. 

Although this tradeoff is less straightforward to understand 
in non-erasure channels, there is empirical evidence that RPCs 
can improve performance in belief -propagation decoding on an 
AWGN channel [1], [2], and, recently, the concept of stopping 
redundancy has provided partial motivation for novel decoding 
algorithms that achieve near-ML performance for short, high- 
rate codes [3]. 

Formally, we define stopping redundancy as follows. Let C 
be an [n, k, d] linear code, and let H = {hij)ixn be a parity- 
check matrix for C. We shall assume that rank(i7) — n — k, 
but I may be larger than {n — k). The Tanner graph G{H) is 
a bipartite graph with n variable nodes, each corresponding 
to one column of H, and I check nodes, each corresponding 
to one row of H, such that variable node j is adjacent to 
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check node i if and only if hij 7^ 0. A stopping set in G{H) 
is a set of variable nodes such that all check nodes adjacent 
to the set are connected to the set at least twice. It is well 
known [4] that iterative erasure decoding is successful if and 
only if the set of erasure locations does not contain a stopping 
set. The size of a smallest non-empty stopping set, referred to 
as the stopping distance and denoted by s{H), is therefore an 
important parameter governing the performance of the iterative 
decoder It is clear that s{H) < d, and it not difficult to see 
that equality can be achieved for any code, for example by 
choosing the rows of H to be the non-zero codewords of the 
dual code C-^. This leads to the following definition. 

Deflnition 1 Let C be a linear code with minimum distance 
d. The stopping redundancy of C, denoted by p(C), is the 
smallest integer such that there exists a parity-check matrix 
H for C with p{C) rows, and s{H) d. 

Stopping redundancy was introduced by Schwartz and 
Vardy [5], [6], and was further studied in [7], [8]. The concept 
was later extended in a number of interesting ways [9], [10], 
[11]. Related concepts, such as stopping set enumerator, and 
generic erasure-correcting sets, were studied in [12], [13], and 
in [14], [15], respectively. 

In this paper, we focus on the special class of MDS codes. 
In the rest of the paper, unless otherwise noted, C denotes an 
[n,k — n — d + l,d\ linear MDS code. In [6], it was shown 
that for all d > 3, 

\ f n \ , max|d^,d— 1|/ n \ 
—Ad-2)<^^'^^^^ ^U-2j' 

where 

d^ ^n-d+2 

is the minimum distance of C^, the dual code of C. The authors 
of [6] then made an intriguing conjecture that p{C) should in 
fact be just a function of n and d. 

Note that the upper bound in ([T]l is never better than 



So the upper and lower bounds can differ by up to a factor of n. 
In [8], we observed that the upper bound can be improved by 
introducing a new combinatorial quantity, the single-exclusion 
number, which we describe below. Before doing so, we first 
review two related, well-studied combinatorial constructs. For 
positive integers n> s > t, and an n-se|^, an (n, s, t)-Turdn 

'An n-set is a set that contains n elements. Similarly, if A is any set, then 
a t-suhset of A is a subset of A that contains t elements. 
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system [16] is a collection of t-subsets of N, called blocks, 
such that each s-subset of the ri-set contains at least one block. 
The (n, s, t)-Turdn number, denoted hereafter by T(n, s, t), is 
the smallest number of blocks in an (n, s, i)-Turan system. A 
concept "dual" to that of a Turan system is that of a covering 
design [17]. Specifically, for n > s > t and an n-set A^, an 
(n, s, t)-covermg design is a collection of s-subsets of N, also 
called blocks, such that each i-subset of the 7i-set is contained 
in at least one block. The {n, s,t)-covering number, denoted 
by C(n, s, t), is the smallest number of blocks in an (n, s, t)- 
covering design. Clearly, by definition, 

T{n, s, t) = C{n, n — t,n — s). 

The stopping redundancy of an MDS code is closely related 
to covering/Turan numbers. In fact, the lower bound in ([T]) 
was shown by noting that to maximize s{H), the supports of 
minimum-weight rows of H must form an (rt, (i+2, n—d~\- 
1) -covering design (equivalently, the complements of supports 
form an d— 1, d — 2)-Turan system). Hence, 

p{C) > C{n, n- d+2,n- d+1) = T{n, d-l,d~2). 

We now define the single-exclusion number, which was 
introduced in [18]. 

Definition 2 For an n-set N and t < n, an {n,t)-single- 
exclusion (SE) system is a collection of f-subsets of N, called 
blocks, such that for each i-subset of N, i — 1, . . . ,t+l, there 
exists at least one block that contains all but one element from 
the i-subset. The {n,t)-single-exclusion (SE) number, S{n,t), 
is the smallest number of blocks in an {n,t)-SE system. 

Note that an (n, t)-SE system is a special kind of (n, t + 
1, t) -Turan system. Note also that if we require all rows in the 
parity-check matrix H to be of minimum weight, then s{H) = 
d is equivalent to the condition that the sets of column indices 
corresponding to the zeros in each row form an (n, d — 2)-SE 
system. Hence, 

piC) < S{n,d-2). 

Therefore, any upper bound on S{n, d — 2) is an upper bound 
on p{C). 

In [8], a number of upper bounds on S{n,t) were ob- 
tained using combinatorial constructions and were shown to 
be superior to the upper bound in ([TJ. It was shown that 
p{C) = T{n,d - l,d - 2) for 1 < d < 4, n > 6, and 
p(C) < T(n, d- 1, d- 2) + 1 for d = 5. It was also shown that 
p{C) is asymptotic to T(n, d — 1. d — 2) (and to S{n^ d — 2)) as 
n — > oo for any fixed d, and that it is asymptotically at most 
3T(n, d — 1, d — 2) for any fixed fc = n — d + 1. 

In this paper, we build upon the work in [8] and investigate 
S{n, t) through a number of different approaches. New upper 
bounds are obtained and analyzed. They are then used to show 
that as n — > oo, p{C) is asymptotic to S{n, d—2) if d = 
o{^j7i), or \f k — o{\/n) ooj^ Hence, in an asymptotic 

- We adopt the standard "O-notation" and related asymptotic expressions 
[19, Ch. 9]. Functions /(n) and g(n) are said to be asymptotic to each other, 
denoted by /(n) ~ if lim„^oo f{n)/g{n) = 1, or equivalently, if 

/(n) = (l + o(l))g(n), where o(l) stands for any function that goes to 
zero as n goes to infinity. More generally, we write /(n) = o(g(n)) if 
lim„^oo f{n)/g{n) = 0. 



sense, the Schwartz- Vardy conjecture is proved in these cases. 
For all d < 5, it is shown that p{C) = S{n, d—2). A lower 
bound on S{n, t) is also derived. 

Besides their application to the stopping redundancy of 
MDS codes, SE systems warrant further study for other 
reasons. As combinatorial objects, they have a very natural 
definition. Therefore, they have intrinsic mathematical appeal 
and interest. They also have practical relevance; for example, 
the definition of SE system can be readily mapped to a problem 
in the design of experiments. 

We remark, also, that some of the results on SE numbers 
are rather surprising. For example, we shall see that for all 
k = o{^/n). 



S{n, n — k — 1) 
T(n, n — k,n — k — I) 



< (1 + 0(1)) 



fc + 1 



Thus, despite the apparently much stricter requirements im- 
posed upon the SE system in comparison to the Turan system, 
the increase in the total number of blocks is very small. 
Insights such as this may shed further light upon properties of 
both MDS codes and the various combinatorial constructs to 
which they are closely related. 

The rest of the paper is arranged as follows. Section [U] is 
devoted to upper bounds on S{n,t). Three approaches are 
attempted: combinatorial constructions, probabilistic methods, 
and recurrent inequalities. Asymptotics, especially those based 
on results from recurrent inequalities, are discussed. Explicit 
(calculable) upper bounds are compared, and the best bounds 



are found for n up to 512. In Section III we show a lower 



bound on S{n, t). In Section IV we comment on the Schwartz- 
Vardy conjecture. We summarize what is known about the 
conjecture based on the results in this paper and previous 
works, and comment on our conjecture made in [18] that 
p{C) — S{n, d — 2). Section [v] concludes the paper. 

II. Upper Bounds ON 5(n,t) 
We start with some preliminaries. 

For any set A, let [AY denote the set of all i-subsets of A. 
If A and B are sets, we say that A covers B if \B \ A\ = 1. 
Hence, if N is an n-set, then S C [N^ is an {n, t)-SE system 
if and only if for each i = 1, . . . ,t + 1 and X e [NY, there 
exists a block in S that covers X. A covering design/Turan 
system/SE system (with prescribed parameters) is minimal if 
it contains the least number of blocks. 

By definition, an {n,t)-SE system is also an {n,t + l,t)- 
Turan system. Hence, we have 



S{n,t) > T{n,t + l,t) > 



1 



n-t\t+l 



1 



t + l\ t 



(2) 



where the second inequality [20] follows by noting that each 
block in the Turan system is contained in (n — t) distinct 
(< + l)-subsets. 

Let C be an [n, fc = n — d + 1, d] linear MDS code. Recall 
that the stopping redundancy of C is related to SE and Turan 
numbers in the following way: 

T{n, d - 1, d - 2) < p{C) < S{n, d - 2), 



3 



or, equivalently, 

T(n, n — k,n ~ k ^ I) < p{C) < S{n, n — k — 1). 
We now focus on upper bounds on S{n, t). 

A. Probabilistic Bounds 

Let N be an 71-set. Consider the following random experi- 
ment in which we build an (n, i)-SE system, S C [iV]*. In the 
first step, for a prescribed real value p G [0, 1], insert into S 
each element of [N]^ with probability p. The expected size of 
S at this point is but some i-subsets, i — 1, . . . , i + 1, 

may not be covered. The probability that a given i-subset is 
not covered equals (1 — p^vin.t,!) ^ where 

f n ^ i \ f n — i 
ipln, t,i) ^ i\ = i 

' ' ^ \t-i + lj \n-t-l 

So, as a second step, for each X £ [N]\ i — 1, . . . ,t + 1, 
that is not yet covered, insert into S some element of [N]* 
that covers X. The expected size of S is then bounded from 
above by 



P 



n \ \ ^ / n 



t J ^ \ i 

=1 



t,i) 



This implies the following upper bound on S{n,t). 
Theorem 1 For all < p < 1, 

5(n,i)<pQ+|j(^")(l-p)<--;0. (3) 

Alternatively, in the first step of the random experiment, we 
may instead make I random drawings from [N^. At the end 
of the first step, the probability that a given z-subset is not 
covered equals 



1 - 



(?) , 



if the drawing is done with replacement, and equals 

where — max{a;,0}, if the drawing is done without 

replacement. The results are the following bounds. 

Theorem 2 For all I e N, 



5(n,t)<^ + 5:(^';j [I 
Theorem 3 For all I eN, I < ("), 



(4) 



t+l / N i-1 



Theorem[3]is clearly stronger than Theorem[2] and is closely 
related to Theorem [T] In fact, one can show that 



So the upper bound (|5]l, when minimized over I, is no greater 
than 



mm 



t+l /^s ^ ^ ^ v(n:t.i) ' 

i 



o<;<(") 

Note that we have strategically allowed I to take the value (") 
in the above expression. On the other hand, letting / = p[") 
in ([3]l, we see the minimum value of the upper bound (|3]l is 



mm 



0) 



0<i<(") ^ 

Now, suppose the minimum value of the above expression 
is y, achieved at I = I*. Then its value at Z = [r] is less 
than y + 1 . Therefore, we conclude that the upper bound ( 5 
(when minimized over I) is less than the upper bound (3 
(when minimized over p) plus two. In practice, the difference 
between the two bounds is very small, while the upper bound 
in ([3]l is usually easier to compute. 

The upper bound in ([3]) can be written as 



p+(l-p) 



t+l 



n ~ t 

m 



t 



(6) 



The first term in ^ is minimized when p takes the value 

Pmi„ = l-(n-t)-i/*, 
in which case ^ becomes 

+ (7) 

where 



r]{n,t) 



-1 t+l 

•E 

i=l 



i=l \n~t-l) 

As n 00, it can be shown that (see Appendix |A]) if i < 
n — In n, then the term corresponding to i = t + 1 prevails in 
the sum dSll. Therefore, 



= (1 + 0(1)) 



{n~t) 



-lit 



Plugging the above into (|7]l, we conclude that as n ^ 00, the 
bound (|7| i^ 



■(l + 0(n-Vt))(n), 



-i/c. 



o 



(1)) (t)- 



1 + 



•ln(n-t) \ \ In(Tt-t) 



if t = (c+ 0(1)) Inn 
if In n ^ t < n — In n 



where c > is any constant. By the choice of Pmin^ and the 
fact that for p = Pmin the (t + l)-st term prevails in ([8]), no 
other values of p give asymptotically tighter bounds than the 
above. 

^We write /(n) -< g{n), if /(n) = 0(3(71)), and similarly, f{n) >- g{n), 
if g{n) = o(/(n)), as n ~t 00. 
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For the case when t > n — Inn, a different asymptotic 
analysis (see Appendix IB]i shows that the bound ([3|, when 
minimized over p, is O (") ) for all t such that 2 < n — 
t = o((nlnlnn)/lnn), and is, particularly, 0(^(")) for 
all< = n-e(l). 

B. Constructive Bounds 

Our first construction for SE systems is based on a Turan 
system construction due to Kim and Roush [21]. 

Construction A Let N be an ?i-set, and t < rt — 2 be a 
positive integer. For a prescribed positive integer I, partition 
N into I subsets, Ni, i — 0, . . . ,1 ~ 1, as equally as possible. 
Thus, N = Uto^" such that [n/?J < \N,\ < {n/l] for 
all i. For all X C N, define the weight of X as 'w{X) — 
E'=J i\XnN^\.FoT each j e {0, . . . , ^ - 1}, a subset of [Nf 
is chosen as 



where 



where 



Z^{Xe[NY ■.3m,XnN^ = (l},N^_i^X}, 



and 



{X e [NY : w{X) = j mod/}. 



Note that in the above definition of Z, the subscript m — 1 is 
to be interpreted as (to — 1 mod I), and we shall stick to this 
convention where applicable. 

Proposition 1 For all j and all I > n/{n — t — 2), Bj as 
given in Construction |a] is an {n,t)-SE system. 

Proof: We show that any X e [N]\ i = 1, ...,<+ 1, is 
covered by a block in Bj. If X n Nm = for some to, let 
Y e [iV\iV™]*+i be selected such that X C y and |rn7V„_i| 
is as small as possible. Since I > n/{n — t — 2), we have 
n — \Nm\ > t + 2, which ensures that Y exists and that if 
Nrn~i ^ X then N^n-i ^ Y. Now, choose x £ X such that 
if N„i-i C X then x G N^-i, otherwise arbitrarily. Note that 
X is covered by Y\{x}. But we also have {Y\{x})r)Nm = 0, 
and Nm-i <^ (Y \ {x}). Therefore, Y\{x} e Z. 

On the other hand, if X n Nm ^ for all to, select one 
element in each such intersection, say Xm £ X r\ Nm- Now, 
choose Y e [A^]*+^ such that X CY, and consider Y\{xm}, 
TO = 0, . . . , Z — 1. All these sets cover X, and since w{Y \ 
{x„i}) = w{Y) — TO, they have distinct weights that span 
I consecutive integers, one of which must be congruent to j 
modulo I. Hence, for all j, there exists to such that Y\{xm} G 
B,. ■ 



Theorem 4 For all integers I >n/{n 

-LfJ 



-2), 

- LtJ - \J^ 

Proof: From Proposition [T| for alH > n/(n — i — 2), 



s(n.«)<i : 



+ / 



Note that 



S{n^ t) < min \Bj 



i-i 

rn—0 



Also, 



Zi,„, = {x e[NY ■.xnN,n = ?)}, 

^2,™ - {X e [NY : X n = 0, Nra-i c X}. 

J=0 



Thus, we have 

min I Bj \ 

j 

= \Z\ + miii\Bj\ 



m=0 



n-|iV„|-|A^„-i| 
t~\N^-i\ 



min |_B., 



< I 



n-LfJ 



"-LtJ-Tt^ 



1 fn 



An alternative (slightly looser) form of the upper bound is 
given in the following theorem. 

Theorem 5 For all integers I > n/{n — t — 2), 

r,, ^ 1 M , r«l fn~ I yj - 1 



Proof: Note that 

i-i ( 
^ = U U [^\(^™U{a})] . 

m=0 aeJVm-i 

The rest of the proof is similar to that of Theorem |4] 
Corollary 1 For fixed k, as n oo, 

2 



^(n, n — k — I) < 



Proof: Theorem [5] applies provided that I > n/{k — 1). 
If /fc > 4, let / = [n/2j. We have 



S{n, n — fc — 1) 



n — fc — 1 



3n / n — 3 
YVn-fc- 1 



fc+ 1 Vfc 



0(n 



For fc = 3, let / = [n/2]. If n is even, the above derivation 
is still valid. If n is odd, note that there is one bin that contains 
a single element, and the rest {n — l)/2 bins all contain two 
elements. From the proof of Theorem [3] we have 



r«/2i-i 



— 3 



m=0 

= (n-2) 



n~\N„^\ - 1 
n - 4 



n - 4 



2 • 



n - 2 
n - 4 
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Hence, 

s(„,„-4)<-i^(_;:^)+oM 

For k < 3, the result has already been shown in [8]. ■ 
Since 

Corollary [T] also implies that for any fixed k, 

— k,n — k — 1), 

confirming a conjecture made in [8]. Note that the asymptotic 
factor of 2 in the above inequality is sharp for k — 1, in which 
case S{n, n — 2) = n — 1, while T{n, n — l,n — 2) — [ri/2] . 
For fc > 1, stronger results can be obtained using recurrent 
inequalities, as shall be discussed in the next section. 

Construction |A] is also a construction for Turan systems. 
Indeed, it can be viewed as an improved version (i.e. one with 
fewer blocks) of the Turan system construction in [21]. 

Proposition 2 For all j and all I, Bj as given in Construc- 
tion ^ is a Turan {n, t + 1, t)-system. 

Proof: The proof is similar to that of Proposition [T] ■ 
Our second construction for SB systems is inspired by a 
construction for Turan systems due to Frankl and Rodl [22]. 

Construction B Let N be an n-set, and t < n be a positive 
integer. Let A^^, i = 0, ...,/- 1, and X C TV, be 

defined as in Construction [A] We will call Ni a bin. For each 
J e {0,...,?-!}, let 

= {X e [Nf : w,{X) < max{e(X),/(X)}}, 

where 

Wj{X) ^ {w{X) + j) modi, 

and 

e{X) ^ \{t : X n ^ 9}\, 

f{X) = \{t : C X}\ 

are the number of "empty" and "full" bins for X, respectively. 
The constructed collection of i-subsets of N is 

B,=FUB,, 

where F is constructed as follows. 

Fix an arbitrary total order on N. Let / C {0, ...,/ — 1} 
be an index set that satisfies J^mei \^-m\ > t, and is minimal 
in the sense that all proper subsets of / violate this condition. 
For each such / and i,j E I, i j, let F include the t-set 
that contains all elements from bins Nm,in G I \ the 
smallest \Ni\ — 1 elements of Ni, and the smallest (t + 1 — 
T.mei,m^j elements of N^. 

Proposition 3 For all j and all I, Bj as given in Construc- 
tion \B\ is an {n,t)-SE system. 



Proof: We show that any X e [7V]% i = l,...,t+l, 
is covered by a block in If i = < + 1, note that all t- 
subsets of X can be written as X \ {x}, for some x G X. 
Since w{X \ {x}) = w{X) - w{{x}) for all x e X, hy 
choosing x from different bins that X intersects, we can make 
w{X \ {x}) take on / — e{X) different values. Since no two of 
these values differ by more than ^ — 1, this also means that we 
can realize I — e{X) different values for Wj{X \ {x}). Since 
only I — e{X) — 1 numbers in {0, . . . , ^ — 1} are greater than 
e{X), there exists x e X such that Wj{X \ {x}) < e{X) < 
e{X \ {x}), hence X \ {x} € Bj, and it covers X. 

If i < t, consider two cases. First, let us assume that there 
exists m, such that X n Nm ^ and ^ X. In this case, 
remove from X an arbitrary element in X n N,n, add in t~i 
other elements from N using as few elements from Nm as 
possible, and call the resulting {t — l)-set X. That is, X — 
{X\{x})UY, for some x G XriN^ and some Y e [iV\X]*-' 
that has a minimal number of elements from N„i. Note that 
the choice of X ensures that f{XU {x}) = f{X). Since 
w{X U {z}) w{X) + w{{z}) for all z X,hy choosing 
z ^ X, z X, from different bins where possible, we can 
make w{XU{z}) take on l~f{XU{x}) = l-f{X) different 
values. This also means that we can realize I — f{X) different 
values for Wj{X U {z}). Since only I — f{X) — 1 numbers in 
{0, . . . , ^ — 1} are greater than f{X), there exists z such that 
Wj{X U {z}) < J{X) < f{X U {z}), hence 1 U {z} e 13 j, 
and it covers X. 

Next, if no m exists such that X n ^ and iV„i ^ X, 
this means that for all m such that X n N„i ^ 0, we have 
Nm Q X. Figuratively, it means that X consists of a number 
of full bins. Let iV^ be any bin that X intersects. Let x E N.i 
be its largest element. Take X \ {x}, and add to it elements 
from bins that X does not intersect, one bin after another, 
from smallest to the largest within each bin, until X \ {a;} is 
augmented to contain t elements. By construction, the i-subset 
thus obtained is contained in F. ■ 

Theorem 6 For all positive integers I, 



S{n,t) < 
where 




Proof: Note that 

E /w= E Ei^c^} 
"=E( E 

i xe[NY 



Similarly, 

xe[jv]t ^ 
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Each X E [NY is contained in precisely 1 
ma.x{e{X)J{X)} B/s. Therefore, 

E (l+niax{e(X),/(X)}) 

xe[N]t 



< 



E {e{X) + f{X)) 

X<£[N]t 



Hence, 



1 /n 



min|Bj| < , , , 
J I \ t 



^-LfJ 



Finally, note that for each valid /, F contains |/|(|/| — 1) 
i-subsets, and a valid / must satisfy > t+1 and 

(|/| - l)ln/l\ <t+l. Therefore, |F| < g{n,t,l). ■ 

C. Recurrent Bounds 

We observe that an (ri,t)-SE system can be constructed 
from an (n— 1, t — 1)-SE system and an (n— 1, i + 1, t)-Turan 
system, as shown in the following lemma. 

Lemma 1 For all < t < n — 1, 

S{n, t) <S{n-\,t-l)+ T{n -l,t+l,t), 
or equivalently, for all < k < n — I, 

S{n, n~k-l) < S{n-l,n-k-2) + C(?i - 1, fc, fc - 1). 

Proof: Let N be an n-set and a G be an arbitrary 
element. Let S C [7V\{a}]*"i be a minimal (n- l,i-l)-SE 
system, and T C [7V\{a}]* be a minimal {n—1, t)-Turan 
system. Define 5' = {s U {a} : s £ S}. Then 5' U T is an 
{n, t)-SE system. Indeed, for all X G [N]\ i = 1, . . . , i + 1, 
if 1 < \X \ {a}\ < t, then there exists s E S such that 
\{X \ {a}) \ s| = 1, which implies that |X \ (s U {a})\ = 1, 
i.e. X is covered by a block in S'. The only cases left are 
when X = {a}, and when X e [A^ \ {a}]*+^. In either case, 
X is covered by a block in T. ■ 



Theorem 7 For all < t < n ~ 1, 

t 



Sin, t) < E -t + i-l,i+l,i), 



i=0 



or equivalently, for all < k < n — 1, 

n-l 

S{n, n - - 1) < E C'ih k,k~ 1). 



(9) 



(10) 



i — k 



Proof: Recursively apply Lemma [T] ■ 
Interesting results follow. When k — 1, ( fTO] ) implies that 
S{n,n — 2) < n — which is sharp. When fc = 2, since 
C(i,2,l) = \i/2'], ([T0| gives 

S{n,n — 3) < 

which is asymptotically tighter than Corollary [T] 

Generally, since exact values of most Turan / covering 
numbers are not known, the upper bounds in Theorem |7] often 



'n' 




n 


2 




.2. 



cannot be directly evaluated. To get a computable upper bound, 
one can simply replace each Turan / covering number in the 
sum by an explicit upper bound. We show several ways to do 
this. The first one is based on a result by Erdos and Spencer 
[23]. 

Theorem 8 For all < k < n — 1, 



5(n,.-fc-l)<— 



Proof: In [23], it was shown that for all n > s > t. 



(11) 



Plugging ( [TT| ) into ( [TQ] i, we obtain the claimed result after 
some algebraic manipulations. ■ 

The second one is based on an upper bound on Turan 
numbers due to Sidorenko [16, Construction 4]. 

Theorem 9 For all Q < t < n — \ and positive integers 
lo,h, ■ ■ ■ , h. 



S{n,t) < E 



i=0 



where 




771 + i 

i 



Proof: Omitted. ■ 
A third way to obtain an explicit upper bound from Theo- 
rem [T] is based on a construction of an (n, fc, fc — 1) covering 
design due to Kuzjurin [24], although we count blocks in a 
sUghtly different manner. 



Lemma 2 For all positive integers n > k, 



C{n,k,k- 1) < 



n 
fc- 1 



1 /n- 1 

,fc-2 



Proof: Let N be an n-set. WLOG, let iV = {1, 2, . . . , n}. 
Let Qi = {X E [N]'' : J^xex ^ = * n}, and d = 

{X E [TV]*^-! : $¥ EQ^, s.t. X C F}, « = 0, 1, . . . , n - 1. 
For each X E Ci, we can add one block Y E [N]'' to Qi, such 
that X C Y. Hence, by adding no more than \Ci\ blocks to 
Qi, we construct an (n, fc, fc — 1) covering design. Therefore, 
for all i, 

c(n,fc,fc-i) < |g.| + |a|. 

Now note that for all Y, Z E Qi, Y ^ Z, we have \Y D Z\ < 
k — 2. Therefore, for all X E [N]''^^ there is at most one 
block Y in Qi such that X C Y; on the other hand, for every 
ye [N]^ there are fc elements X E [N]''^'^ such that X cY. 
Hence, 

\a\^i''_^-k\Q,\. 
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We have 

n— 1 n— 1 



1=0 



=0 



n 
k-1 



-(fc-i)IQ.I 



nl , A -{k-1) 



1 

Therefore, there exists i, such that 

n \ k — I / 71 



m + \Q\ < 



k — Ij n \k 
1/ n \ k — \ fn — \ 



k \ k 1/ k \ k 2 



Plugging the bound in the preceding lemma into ( fTO] ), we 
obtain the following theorem. 



Theorem 10 For all < k < n - 1, 

, , N 1 M k~l fn-l 
'^''^^-'-'^^-k[k)+—[k-l 

Proof: Omitted. ■ 

Corollary 2 For all < k = o(^/n), as n ^ oo, 

Sin,n-k-l)<{l + o{l))l(^fj. 

In particular, if in addition /c — > cx3, then 

S{n, n — fc — 1) ~ T(n, n — k,n — k — 1). 

Proof: Omitted. ■ 
Interestingly, a similar asymptotic result can be shown for 
S{n, t) when t is relatively small. First, we note the following 
theorem relating SE and Turan numbers. 

Theorem 11 For all < t < n - 1, 

S{n,t)< (^l-i^T{n,t+l,t)+(^'^2l 

Proof: We will use the fact [25] that 

fi 

T{n, s, t) > T{n -l,s,t). 

n — t 

From Theorem |7] we have 

t-i 

S{n, t) < T{n - 1, t + 1, + E T{n - t + i ~ l,i + 



1=0 





t 








n 




t 






n 




t 








n 



t-l 



i=0 
n-2 



i—n—t—1 

n- 1 
t- 1 



n — t + i — 1 
i 



In [8, Theorem 21], it was shown that S{n, t) is asymptotic 
to T(n, t) for any fixed i as n ^ oo. Theorem 1 1 enables 
us to extend this result to all t = o{^/ri). 




50 100 150 200 250 300 350 400 450 500 



Fig. 1. Best upper bounds on S{n, d — 2) (hence on p(C)), for 5 < ci < 
n < 512. 



Corollary 3 For all t = o{y/n), as n — > oo, 
S{n,t) - T{n,t + \,t). 
Proof: Note that 



T{n,t+l,t) 



< 



t+i \tJ 



The result then follows immediately from Theorem 1 1 



D. Comparison of Upper Bounds 

We numerically computed several of the upper bounds on 
S{n, d - 2), and hence on p{C), for all 5 < d < n < 512. 
For each (n, d) pair, the tightest bound is identified, and the 
results are shown in Fig. [T] In the figure, "Construction A" 
refers to Theorem |4] "Construction B" refers to Theorem |6] 
"Probabilistic" refers to Theorem [T] "Recurrent B" refers 
to Theorem |9] and "Recurrent C" refers to Theorem [TO] 
Other bounds included in the comparison (but not appearing 
in the figures) are "Schwartz- Vardy" ([TJ, "Construction 1" 
([8, Theorem 27]), "Construction 3" ([8, Theorem 39]), and 
"Recurrent A" (Theorem|8]l. Note that d < 5 is not considered, 
since in this case S{n, d — 2) (and p{C)) is known to be at 
most T(n, d — 1, d — 2) + 1, for which either precise formulas 
are known, or tighter special upper bounds exist [8]. 

We observe that the most "successful" bounds are Recurrent 

B, Recurrent C, Probabilistic, and Construction A. A minor 
exception is Construction B, which excels occasionally for 
certain small values of (n, d). As n gets larger, a trend can be 
seen. Roughly speaking, for code rate {n — d + 1) /n > 1 /5, 
Recurrent B is the best bound. For code rate less than 1 /5, the 
best bounds are Probabilistic, Construction A, and Recurrent 

C, in that respective order, as code rate gets progressively 
smaller 

A few samples of the upper bounds are given in Table [l] 
The tightest are highlighted in boldface. For comparison, a 
lower bound on T{n,d — l,d — 2) (hence on S{n,d — 2) 
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TABLE I 

Upper bounds on S{n, d—2) and p(C) 



(n,d) 


(31,7) 


(31,23) 


(31,27) 


Probabilistic 


96,112 


6,412,596 


77,298 


Construction A 


93,691 


7,693,683 


86,148 


Construction B 


76,986 


12,151,903 


299,697 


Recurrent A 


124,250 


7,161,809 


88,673 


Recurrent B 


71,891 


9,665,343 


520,847 


Recurrent C 


599,474 


7,442,607 


55,905 


Construction 1 [8] 


93,691 


7,786,707 


106,388 


Construction 3 [8] 


76,986 


16,275,110 


269,970 


Schwartz- Vardy flj 


142,506 


31,475,730 


617,526 


Lower Bound jT2| 


33,981 


2,103,660 


29,450 



and p{C) as well) has been included, based on ( [T2] i (see 
Section [lll| . Compared to upper bounds previously known, 
significant improvements can clearly be seen. As a side note, 
we caution that while Recurrent C is an excellent bound when 
d is very close to n, it gets loose quickly as d gets smaller, 
and should be avoided if the code rate is greater than 1/2. 

III. Lower Bounds on Sin, t) 

A few other lower bounds on Turan / covering numbers are 
known, besides the simple lower bound in (|2]i. For example, 
Schonheim [25] showed that 



n — t 



n-t-1 



Another useful bound is due to De Caen [26]: 

^, , 1 n — t fn 
T{n,t + l,t) > 



Note that a lower bound on T{n,d ~ l,d — 2) is in turn a 
lower bound on p{C) and on S{n, d—2). 

In this section, we study a lower bound result on S{n, t) 
(which is not a lower bound on T{n, t+\,t) in general). Note 
that a lower bound (just) on S{n, d — 2) is not necessarily a 
lower bound on p{C). 



Theorem 12 For all <t < n 

n — t 



1, 



1 



S{n,t) 



n{n-t- 1/2)^ 

>T{n-l,t+l,t) + 



1 



t-l/2\ t 

Proof: Let N be an n-set, and S C [N]* be a minimal 
(rt, i)-SE system. For each j E N, S can be partitioned into 
blocks that contain j and those that do not, namely. 



S 



where Aj = {X E S : j E X}, and B. 



Note that for all j, Bj is an (n 



{X e 5 : J ^ X}. 
l,t + l,i)-Turan system. 
A E Aj}, then each t-set 



Further, if we let Aj = {A \ {j} 

in [N \ \ Bj contains at least one element of A'j. To see 
this, suppose X E [N\{j}Y\Bj. Then XU{j} is a (<+l)-set 
and so there exists Y E S such that Y C [X U {j}). Since 
X ^ 5, we have j eY and hence A'^ 3 iY\{j}) C X. 



On the other hand, since each element of A'^ is contained 



in 



(n- 1) - (i - 1) = n-t 



i-subsets of iV\{j}, it is contained in at most (n — t) distinct 
i-sets in [N \ {j}]* \ Bj. Therefore, 



n — t 



n — 1 
t 



- \B., 



This lower estimate can be improved by a more careful 
argument as follows. Let 

A'ji ^{Ae A!.j : 3B E Bj,A C B}, 

^J2 = 



A' 



{AeA!^\A!^^:3A eA!.^,,\A\j^\^1}, 
{A E 4 \ U : 3A' E 1^ \ A\ = 1}, 

A e 4 \ y 4, : 3A E \A\A'\ = \\, 

1=1 ^ 



Note that A'j^ n A'ji = 
exists i such that A'^i ~ 



for all i ^ I. Since A'j is finite, there 
for all I > i. Regardless, define 



1=1 



We claim that on average, an element in A'j is contained in at 
most (n-t-l) i-sets in [N\{j}Y\Bj. To see this, consider 
(12) a process in which we enumerate elements of A'j, and for each 
element, "mark" the t-sets in [N \ \ Bj that contain it. 



We start with elements in A'ji and proceed to A'j2,A'j^, 



and 



so on. Each element in A'ji is contained in (n — t) i-sets in 
[A^\ {j}]*, at least one of which lies in Bj. Therefore, for each 
element in A'ji, at most (n-t-l) t-sets are marked. Now, 
for any X E A'j2, by definition, there exists Y E A'ji, such 
that = 1. Hence, among the (n — t) t-sets that contain 

X, at least one of them, namely, X UY, is already marked. 
Therefore, processing any X E A'j2 marks at most (n-t-l) 
additional t-sets in [A^\ {j}]* \ Bj. A similar argument shows 
that among the (n—t) t-sets that contain an element of A'j^, at 
least one of them is already marked after elements of 
have been processed. 

For A'j \ A'j , we show that on average, each element marks 
at most (n - t - 1/2) t-sets in [N \ {j}]* \ Bj. Let X E 
A'j \ A'j. As X(J {j} E [Xy, there exists Y E S, such that 
\(XLl{j})\Y\ = 1. Since X ^ A'^^, we have j eY and hence 
Y \ {j} E A'j. In fact, Y \ {j} E A'j \ A'^, since othei-wise 
it would imply that X E A'j. Now, let Z = X UY\{j}, 
and denote by I the number of elements of A'j \ A'j that are 
contained in Z. Note that I > 2, since Z contains both X and 
Y\{j}. Therefore, of the I elements that are contained in Z, 
each on average marks 

n-t-(l- l)/l <n-t - 1/2 

t-sets in [N \ {j}]* \ By For other elements in A!j \ A'^, the 
above argument can be repeated until all elements have been 
considered. 
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Based on the preceding discussions, we conclude that on 
average, each block in A'j is contained in no more than {n — 
t - 1/2) t-sets in [N \ {j}]* \ Bj. Hence, 



\S\ = \A'^\ + \B, 



> 



n-t- 1/2 VV i 



n — 1 



Bj\ ) +T{n-l,t+l,t). 

(13) 



Since each block of S appears in {n — t) B/s, we have 
J2\B,\^{n-t)\S\. 

Summing ( [T3| ) over all j, dividing both sides by n, and noting 
that \S\ = S{n,t) (since S was chosen to be minimal) gives 
the desired inequality. ■ 

Corollary 4 For all < t < n — 1, we have 
S{n,t)> ^ ^ " 



n-t- {t/n) + l/2\t+l 

Proof: Simply use the facts that 

„ . s 1 fn — 1 

T(n- l,t+ l,t) > 



n-t-l \t+ 1 /' 



and 



n \ In — 1\ In — 1 
t+\l " U+1 j ^ \ t 



Equivalently, if we let fc = n — i — 1, then we have that for 
all < < n - 1, 



S{n, n — k — 1) > 



1 



A: + (fc + l)/n+ 1/2 

To relate this to the asymptotic results shown earlier, we 
note the following corollary. 

Corollary 5 For all k > 0, as n ^ oo, 

.(.,.-fc-l)>(l-o(l))^(f 

Proof: Trivial. ■ 
For fixed k, T{n, n — fc, n — fc — 1) is asymptotic to (^) 
(cf. [27]). So the above corollary shows that for any fixed k, 

the ratio S{n, n — k — 1) / T{n, n — k,n — k — 1) is bounded 
away from 1 as n ^ cx3. 

IV. On the Schwartz- Vardy Conjecture 

Schwartz and Vardy [6] conjectured that the stopping re- 
dundancy of an MDS code only depends on its length and 
minimum distance. 

In [8], we showed that for given 1 < o? < 5 and n > d, 
the leeway in p{C) for any [n,n ~ d + l,d] MDS code C is 
at most one. We can now close the gap completely and show 
that the Schwartz-Vardy conjecture is true for all MDS codes 
with 1 < d < 5, using the recurrent bounds of Theorem |7] 

Theorem 13 For all n> 6, 



Proof: Let T be a minimal Turan (n, 4, 3)-system, n > 6. 
We show that T must also be an (n, 3)-SE system. Since T is 
a Turan (n, 4, 3)-system, all 4-sets are covered. In [8], it was 
shown that all 1-sets and 2-sets are covered too. It remains 
to show that all 3-sets are covered. Suppose, to the contrary, 
that some 3-set, X, is not covered. It was shown in [8] that at 
least 1 + 2 ("2^) blocks in T contain elements from X, and 
those that do not form a Turan (n — 3, 4, 3)-system. Hence, 

S{n,3) > T(n,4,3) = \T\ > T(n - 3, 4, 3) + 2 ^ ^) + 1- 
However, from Theorem |7] 

S{n, 3) < T(n - 1, 4, 3) + T(n - 2, 3, 2) + T{n - 3, 2, 1) + 1. 
And since (cf. [16]) 

T{n, s,t) <T{n-l,s,t) + T{n-l,s-l,t~l), 
we have 

r(n-l,4,3) < r(n-3,4,3)+r(n-3,3,2) + T(n-2,3,2). 
Together, these imply that 

Sin, 3) < T(n-3, 4, 3)+2T(n-2, 3, 2)+T(n-3, 3, 2)+n-3. 

Putting the upper and lower bounds on S{n, 3) together, we 
have 

- 3^ 



2r(n-2,3,2) +r(n-3,3,2) 



3 > 2 



1. 



However, since it is known (cf. [28] [29]) that T{n, 3, 2) = 
[n/2j ([n/2] — 1), the above inequality results in a contradic- 
tion for all n > 9. For n = 6, 7, 8, the theorem can be verified 
directly, and was also proved in [8] (for 6 < n < 53) using a 
different argument. ■ 



S{n,3) = T(n,4,3). 



Summarizing Theorem 13 and results in [8], we make the 
following conclusion. 

Proposition 4 If C is an [n, n — d + 1, d] MDS code, and 
1 < d < 5, then 

p{C) = S{n,d~2). 

Proof: For {n,d) G {(4, 4), (5, 4), (5, 5)}, it is easy to 
find that S'(4, 2) = 3, S'(5, 2) = 5, 5(5, 3) ^ 4, and verify 
that p{C) — S{n, d — 2) in all three cases. Otherwise, since 
r(n,d-l,d-2) < p{C) < 5(71, d-2), it follows immediately 
from Theorem [T3] and [8, Theorem 14, Theorem 16] that 

p(C) = S{n, d~2)= T{n, d - 1, d - 2) 

in all other cases. ■ 
In addition, from Corollary |2] and Corollary [3] we see that 

as n — > 00, if d = o{y/n), or if k — o{y/n) 00, then 

p{C) - S'(n,d-2). 

So in these cases we may say that the Schwartz-Vardy con- 
jecture holds in the asymptotic sense. 

Our approach regarding the conjecture has been so far to 
show that in some cases the upper and lower bounds on 
stopping redundancy (SB and Turan numbers, respectively) 
converge, either exactly or asymptotically. However, we have 
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seen that in other cases the corresponding SE and Turan 
numbers can be provably different, even in the asymptotic 
sense (for example, when fc is a fixed constant), which shows 
the limitation of the current approach in fully resolving the 
conjecture. 

In fact, it is our belief (cf. [18]) that for an [n,n — d+ 1, d] 
MDS code C, 

p{C) = S{n,d-2), 

the proof of which would in turn prove the Schwartz-Vardy 
conjecture. We have shown that this is true (or close to be true) 
when either d or A: is o{y/n). A reasonable question to ask 
is: what if both d and k are larger than o{y/n)l For example, 
what if k jn approaches a constant? The current approach only 
bounds p(C) to within a factor of up to Inn. For example, 
using the result of Theorem [8] we have 

n-d + 2 l + ln(n-d+l) 



n 



1 



n 
d-2 



while, for the lower bound, we saw that 

1 / n 



PiC)> 



d-l\d-2 



Alternatively, let's make the following observation. Suppose 
we are given one optimal parity-check matrix, i.e. one with 
p{C) rows that maximizes stopping distance. It is not apparent 
that all rows should have minimum weight, but suppose T' 
rows are of minimum weight and the rest are not. We can 
replace each row that is not of minimum weight (and whose 
weight is, of course, at most n) with no more than \n/ {n — d+ 
2)] minimum-weight rows, such that the union of supports of 
these rows is precisely the support of the row they replaced. 
It is simple to verify that the replacement procedure does not 
decrease the stopping distance, which also implies that the 
rank of the matrix is not reduced. After all rows that are not 
of minimum weight have been replaced, we obtain a parity- 
check matrix with at most 



d+2 



{p{c)-r) 



rows, all having minimum weight, that achieves maximum 
stopping distance. Therefore, 



Now note that 
so we have 



-d+2 



{p{C)-r)>S{n,d-2). 



T' > T{n,d-l,d-2), 



rr\ S{n,d~2) \ n-d+2'] ^ r^r j 1 j o^ 
piC) > r „ . + r n 1 ■T{n,d-l,d-2). 

I n-d+2 I I n-~d+2 I 

Without knowing better how T{n,d— 1, d — 2) compares with 
S{n, d — 2), if we just ignore the second term, we obtain 



S{n,d~2) 

\ n-d+2~\ 



< p{C) < S{n,d-2). 



This shows that in many cases 5(n, d— 2) is a good estimate of 
p{C). For example, if the code rate R= {n — d+\) /n > 1/2, 
then 

^S{n, d - 2) < p{C) < S{n, d-2). 

And, clearly, for any constant code rate, p{C) is within a 
constant factor of S{n,d— 2). 

V. Concluding Remarks 

While we have obtained a fairly good understanding of the 
SE number, and in some cases come close to uncovering the 
true value of the stopping redundancy of MDS codes along 
the way, many interesting questions remain unanswered. For 
example, what is the asymptotic value of S{n, n — k ~ 1) for 
a fixed kl (Is it asymptotic to |;(^)?) And how does S{n,t) 
compare with T{n,t + l,t) in general? (Do they differ by 
at most a constant factor?) Finally, is it true that the stopping 
redundancy of an [n, n—d+1, d] MDS code equals S{n, d— 2) 
(as we conjectured in [18])? 

Appendix A 
ASYMPTOTICS OF dSll FOR < < n - In n 



Rewrite ([8| as 



t+i 



(14) 



where 



Assuming 



/(*) 



Vn-t-l/ 



ji — t > Inn, 
we show that the (< + l)-st term, 

/(< + l) = (n-t)-(*+i)/*, 

prevails in the sum of ([14]). 

It suffices to show that each of the other t terms is a o{l/t) 
fraction of f{t+ 1). For i = t, it is easy to verify that 

fit) ^ i±i . (n - t)-("^*) = o{l/t)f{t + 1). 
In general, we have 

For i = 1, . . . ,t — 1, consider two cases. For i < 2n/ In n = 
0(71), we have 



> 



In this case, ( [T5] l decreases super-exponentially with n, and is 
certainly o{l/t). On the other hand, for 2n/lnn < i < t — 1, 
we show that f{i) < f{t) = o(l/i)/(i+ 1), by showing that 
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f{i) is monotonically increasing for 2n/lnn < i <t. Indeed, 

for 2n/ Inn < i < t ~ 1, we have 



f{i) I + 1 

n — i 
In n 



> 



> 



> 1. 



• (In n) * ( 1" " In"" ) ('"2") 



'In n 



In n 



Appendix B 

ASYMPTOTICS OF THE BOUND ^ FOR t > 71 - In n 

Let k — n — t — 1, and j = t + 1 — i. The right-hand side 
of Q becomes 



Let p = (In n) / n. Assume that 



1 <k^ 



i In In 5 
Inn 



(n — lnn<t<n~2 being a special case). We show that the 
first term in ([T6| prevails. Note that 



n J 

n-k-j tk+j\ 

= n " \ I' ) . 



For 1 < j < 71-^/2/ In n, we have 



For j > ny/2/lan, we have 



In n , , s /fc + A In n /i + 2^ 



Noting further that for j > 1, 



we see that the second term in ([T6| is at most 




Hence, at p = (In n) /n, the upper bound ^ is asymptotic to 
for all t such that 2 < n - t = o((nlnlnn)/lnn), 
which implies that for 2 < n — t = o((n In Inn)/ Inn), the 
upper bound ([3]l, when minimized over p, is 

Note that the O ( ^ (") ) estimate is not always tight. For 
example, when t ^ n — Inn, we have shown using a different 
analysis that the upper bound ([3]), when minimized over p, is 
in fact However, note that by keeping just the 

i — t term in the sum, the upper bound ([3]) is at least 

(p+(l-p)*("-)) 

(1 - (1 - 1/a) . Q 



> 



In a 
a 



O 



where a = t{n — t). In particular, this shows that the 



O 



' In n 



estimate is tight if n—t is 0(1). That is, the bound 



(|3]l is ( (") ) when minimized over p, for alH = n— ( 1 ) . 
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